Improving the Extraction of Clinical Concepts from Clinical Records

نویسندگان

  • Xiao Fu
  • Sophia Ananiadou
چکیده

Essential information relevant to medical problems, tests, and treatments is often expressed in patient clinical records with natural language, making their processing a daunting task for automated systems. One of the steps towards alleviating this problem is concept extraction. In this work, we proposed a machine learning-based named entity recognition system to extract clinical concepts from patient discharge summaries and progress notes without the need for any external knowledge resources. Three preand post-processing methods were investigated, i.e. truecasing, abbreviation disambiguation, and distributional thesaurus lookup, the individual annotation results of which were combined into a final annotation set using two refinement schemes. While truecasing and abbreviation disambiguation capture the inflectional morphology of words, the distributional thesaurus lookup allows for statistics-based similarity matching. We achieved a maximum F-score of 0.7586 and 0.8444 for exact and inexact matching, respectively. Our results show that truecasing and annotation combination are the enhancements which best increase the system performance, whereas abbreviation disambiguation and distributional thesaurus lookup bring about insignificant improvements.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Clinical Governance in Primary Care Principles, Prerequisites and Barriers: A Systematic Review

Introduction: Primary care organizations are the entities through which clinical governance is developed at local level. To implement clinical governance in primary care, awareness about principles, prerequisites and barriers of this quality improvement paradigm is necessary. The aim of this study is to pool evidence about implementing clinical governance in primary care organizations. Data so...

متن کامل

Linguistic and semantic annotation for information extraction and characterization

The 2010 I2B2 NLP challenge concentrated on extraction of three types of information from clinical records: medical concepts, their certainty status and relations between them. For participation in this challenge, we designed an automatic NLP system exploiting terminological resources and a rule-based approach. An attemp was also made to apply knowledge engineering methods. Our system provides ...

متن کامل

Bidirectional LSTM-CRF for Clinical Concept Extraction

Automated extraction of concepts from patient clinical records is an essential facilitator of clinical research. For this reason, the 2010 i2b2/VA Natural Language Processing Challenges for Clinical Records introduced a concept extraction task aimed at identifying and classifying concepts into predefined categories (i.e., treatments, tests and problems). State-of-the-art concept extraction appr...

متن کامل

Automatic medical concept extraction from free text clinical reports, a new named entity recognition approach

Actually in the Hospital Information Systems, there is a wide range of clinical information representation from the Electronic Health Records (EHR), and most of the information contained in clinical reports is written in natural language free text. In this context, we are researching the problem of automatic clinical named entities recognition from free text clinical reports. We are using Snome...

متن کامل

The Obstacles and Improving Strategies of Clinical Education from the Viewpoints of Clinical Instructors in Tehran’s Nursing Schools

Introduction: The situation of clinical education should be always evaluated in order to improve its quality. This study was performed to determine the viewpoints of clinical instructors about obstacles of clinical teaching and strategies for improving its quality in Tehran Nursing Schools in 2005. Methods: In this cross-sectional survey 60 instructors of Tehran’s nursing and midwifery schools...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014